Visualizing breast cancer data with t-SNE
ثبت نشده
چکیده
One in eight women will get breast cancer in her lifetime and in 2008 it has caused 458.503 deaths among the world [15]. Despite that technology has made considerable improvements in the last decades, there is still room for more advances. A technique that possibly can contribute to this field is t-SNE [24]. The aim of this thesis is to investigate whether t-SNE is able to present the breast cancer data in an interpretable way and possibly improves the classification performances. We employ two approaches to explore the applicability of t-SNE. In the first approach we compare the visualizations and in the second approach the classification performances are compared. We found that classification on the original data performed significantly better than on t-SNE data. This suggests that t-SNE is not applicable to the breast cancer data set.
منابع مشابه
Visualizing Data using t-SNE
We present a new technique called “t-SNE” that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map. The technique is a variation of Stochastic Neighbor Embedding (Hinton and Roweis, 2002) that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map. ...
متن کاملGraph Layouts by t-SNE
We propose a new graph layout method based on a modification of the t-distributed Stochastic Neighbor Embedding (t-SNE) dimensionality reduction technique. Although t-SNE is one of the best techniques for visualizing high-dimensional data as 2D scatterplots, t-SNE has not been used in the context of classical graph layout. We propose a new graph layout method, tsNET, based on representing a gra...
متن کاملData-driven identification of prognostic tumor subpopulations using spatially mapped t-SNE of mass spectrometry imaging data.
The identification of tumor subpopulations that adversely affect patient outcomes is essential for a more targeted investigation into how tumors develop detrimental phenotypes, as well as for personalized therapy. Mass spectrometry imaging has demonstrated the ability to uncover molecular intratumor heterogeneity. The challenge has been to conduct an objective analysis of the resulting data to ...
متن کاملVisualizing Time-Dependent Data Using Dynamic t-SNE
Many interesting processes can be represented as time-dependent datasets. We define a time-dependent dataset as a sequence of datasets captured at particular time steps. In such a sequence, each dataset is composed of observations (high-dimensional real vectors), and each observation has a corresponding observation across time steps. Dimensionality reduction provides a scalable alternative to c...
متن کاملافزایش غلظت پلاسمایی و میزان MMP-9 فعال در بیماران متاستازی سرطان پستان و ارتباط آن با وجود آلل T در پروموتور این ژن
Background and purpose: Matrix metalloproteinase are a family of proteolytic enzymes that have specific functions in digestion of cells cohesive extra cellular matrix and also, increasing metastasis behavior of acute human tumors. It has been reported that MMPs in two forms, namely proenzyme and active enzyme in biological samples. It is distinguished that among this family, (MMP-9) Matrix Me...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013